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Cfr 101(3370) 
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Bso 1(3351) 
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ColEI Ori 
Woen(2645) 
Pci 1(2397) 
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fl/n 1(206) 
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Sfi 1(260) 
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' Spe 1(440) 
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CMVp/Enhoncer/IntronA 
4b9 1(1096) 
/fc/ XI(1096) 
Vsp 1(1096) 

S&c n(1096) 
X/na m(1096) 
•Ppu 101(1190) 
Ws/ 1(1194) 
fifpu 11021(1260) 
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W 1(1 303) 
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Intron A 
U/n(1768) 
1(1768) 
Vvun(1799) 
WpoI(1853) 
^Pst 1(1963) 
\sal 1(1973) 
Mcc 1(1974) 
fcoR 1(1983) 
I Foe R71(1992) 
X/»o 1(1992) 
'X6o 1(2002) 

Wc 1(2011) 
|£coRV(2022) 
Wh 1(2028) 
\mIu 1(2037) 
W 1(2053) 
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— C>1 TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 
AGCGCGCAAA GCCACTACTG CCACTTTTGG AGACTGTGTA CGTCGAGGGC 

51 GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 
CTCTGCCAGT GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTGTTCGGGC 

101 TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
AGTCCCGCGC A6TCGCCCAC AACCGCCCAC AGCCCCGACC GAATTGATAC 

Hindi I I 



151 CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGAA GCTTTTTGCA 
GCCGTAGTCT CGTCTAACAT GACTCTCACG TGGTATACTT CGAAAAACGT 

201 AAAGCCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 
TTTCGGATCC GGAGGTTTTT TCGGAGGAGT GATGAAGACC TTATCGAGTC 

251 AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA 
TCCGGCTCCG CCGGAGCCGG AGACGTATTT ATTTTTTTTA ATCAGTCGGT 

301 TGGGGCGGAG AATGGGCGGA ACTGGGCGGG GAGGGAATTA TTGGCTATTG 
ACCCCGCCTC TTACCCGCCT TGACCCGCCC CTCCCTTAAT AACCGATAAC 

351 GCCATTGCAT ACGTTGTATC TATATCATAA TATGTACATT TATATTGGCT 
CGGTAACGTA TGCAACATAG ATATAGTATT ATACATGTAA ATATAACCGA 

401 CATGTCCAAT ATGACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 
GTACAGGTTA TACTGGCGGT ACAACTGTAA CTAATAACTG ATCAATAATT 

451 TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 
ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT ACCTCAAGGC 

501 CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC GGGTTGCTGG 

551 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 
GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA TTGCGGTTAT 

601 GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 
CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA TTTGACGGGT 

651 CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTCCGCCC CCTATTGACG 
GAACCGTCAT GTAGTTCACA TAGTATACGG TTCAGGCGGG GGATAACTGC 

701 TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 
AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT GTACTGGAAT 

751 CGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 
GCCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT AGCGATAATG 

801 CATGGTGATG CGGTTTTGGC AGTACACCAA TGGGCGTGGA TAGCGGTTTG 
GTACCACTAC GCCAAAACCG TCATGTGGTT ACCCGCACCT ATCGCCAAAC 

851 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 
TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT ACCCTCAAAC 
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.0. ^So«cT»"^^''* 

GGGCAACTGC GTTTA^v, ^^^r^cG CCATCCACGC 

CGTCTCGAGC ARATCACIi ^,^^^cC TCCGCGGCCG 

.0. 

ACAARACTGG AGGTATCT .^^cAAGAGT GACGTAAGTA 

..0. SSSS c.c.™« 

CCTTGCCACG TRACCTTG ^.pprCTTAT GCATGCTATA 

SSSS SSS.™ 

GACAAAAACC GAACCL^ „,,^p.TTA TTGACCACTC 

— SSS ISSSi s^-- — 

GGGATAACCA CTGCTA ^^cTCTGTCC TTCAGAGACT 

SSS^ SSSi I^Skc- 

ggtgttgata gagataac ,,^catttat tatttacaaa 

CTGTGCCTGA GACATAi^ ^^^^rTTI TTATTAAACA 

AAGTGTATAT GTTGTTt, ^^^t-ccgGAC ATGGGCTCTT 

..s. SSI- ?s-»- 

GAGGCCATCG CCGCCl^ ,,,^,rT GGAGGCCAGA 

CGCCGAGTAC CAGCGAt.^^ ^^^^-rC ACAAGGCCGT 

CCTGCGTCTA CCTTCT ^.^ptCCCGT TGCGGTGCTG 



Appln. No. 10/715,665 
Replacement Sheet 



1851 TTAACGGTGG AGGGCAGTGT AGTCTGAGCA GTACTCGTTG CTGCCGCGCG 
AATTGCCACC TCCCGTCACA TCAGACTCGT CATGAGCAAC GACGGCGCGC 

1901 CGCCACCAGA CATAATAGCT GACAGACTAA CAGACTGTTC CTTTCCATGG 
GCGGTGGTCT GTATTATCGA CTGTCTGATT GTCTGACAAG GAAAGGTACC 

Sail EcoRI Xhol 



1951 GTCTTTTCTG CAGTCACCGT CGTCGACCTA AGAATTCAGA CTCGAGCAAG 
CAGAAAAGAC GTCAGTGGCA GCAGCTGGAT TCTTAAGTCT GAGCTCGTTC 

Xbal AscI EcoRV BamHI Mlul 



2001 TCTAGAAAGG CGCGCCAAGA TATCAAGGAT CCACTACGCG TTAGAGCTCG 
AGATCTTTCC GCGCGGTTCT ATAGTTCCTA GGTGATGCGC AATCTCGAGC 

2 051 CTGATCAGCC TCGACTGTGC CTTCTAGTTG CCAGCCATCT GTTGTTTGCC 
GACTAGTCGG AGCTGACACG GAAGATCAAC GGTCGGTAGA CAACAAACGG 

2101 CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG GTGCCACTCC CACTGTCCTT 
GGAGGGGGCA CGGAAGGAAC TGGGACCTTC CACGGTGAGG GTGACAGGAA 

2151 TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA GGTGTCATTC 
AGGATTATTT TACTCCTTTA ACGTAGCGTA ACAGACTCAT CCACAGTAAG 

2201 TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
ATAAGACCCC CCACCCCACC CCGTCCTGTC GTTCCCCCTC CTAACCCTTC 

2251 ACAATAGCAG GCATGCTGGG GAGCTCTTCC GCTTCCTCGC TCAGTGACTC 
TGTTATCGTC CGTACGACCC CTCGAGAAGG CGAAGGAGCG AGTGACTGAG 

2301 GCTGCGCTCG GTCGTTCGGC TGCGGCGAGC GGTATCAGCT CACTCAAAGG 
CGACGCGAGC CAGCAAGCCG ACGCCGCTCG CCATAGTCGA GTGAGTTTCC 

2351 CGGTAATACG GTTATCCACA GAATCAGGGG ATAACGCAGG AAAGAACATG 
GCCATTATGC CAATAGGTGT CTTAGTCCCC TATTGCGTCC TTTCTTGTAC 

24 01 TGAGCAAAAG GCCAGCAAAA GGCCAGGAAC CGTAAAAAGG CCGCGTTGCT 
ACTCGTTTTC CGGTCGTTTT CCGGTCCTTG GCATTTTTCC GGCGCAACGA 

2451 GGCGTTTTTC CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC 
CCGCAAAAAG GTATCCGAGG CGGGGGGACT GCTCGTAGTG TTTTTAGCTG 

2 501 GCTCAAGTCA GAGGTGGCGA AACCCGACAG GACTATAAAG ATACCAGGCG 
CGAGTTCAGT CTCCACCGCT TTGGGCTGTC CTGATATTTC TATGGTCCGC 

2551 TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA CCCTGCCGCT 
AAAGGGGGAC CTTCGAGGGA GCACGCGAGA GGACAAGGCT GGGACGGCGA 

2601 TACCGGATAC CTGTCCGCCT TTCTCCCTTC GGGAAGCGTG GCGCTTTCTC 
ATGGCCTATG GACAGGCGGA AAGAGGGAAG CCCTTCGCAC CGCGAAAGAG 

2651 AATGCTCACG CTGTAGGTAT CTCAGTTCGG TGTAGGTCGT TCGCTCCAAG 
TTACGAGTGC GACATCCATA GAGTCAAGCC ACATCCAGCA AGCGAGGTTC 

2 701 CTGGGCTGTG TGCACGAACC CCCCGTTCAG CCCGACCGCT GCGCCTTATC 
GACCCGACAC ACGTGCTTGG GGGGCAAGTC GGGCTGGCGA CGCGGAATAG 



FOG. 1D 



r^m^r TTATCGCCAC 

.s. . ^sss 

ACCGTCGTCG GTGACCATT ^.p^^cTACA CTAGRAGGAC 

""^^^ cgatgtctca agaacttcac ^,,^pttc GGAAAAAGAG 

.0. ^SSS rcSSS — 

TCATAAACCA TAGACGCGAC, ^^tAG CGGTGGTTTT 

AAACAAACGT TCGTCGTCTA .^^qtGGAAC GAAAACTCAC 

.c^- ss^s sss sss- 

AGGAAACTAG AAAAGATGCC ..;.qgATCTT cacctagatc 

3.0. ^ SSS SSSS — = 

CAATTCCCTA AAACCAGTAC .,.^^;,GTA TATATGAGTA 

GAAAATTTAA TTTTTACl ^.^^.g^qGCA CCTATCTCAG 

3,03 ^-SSS«--^-'=°'^^. 
TTGAACCAGA CTGTCAAi ^^^^f^TCCC CGTCGTGTAG 

3.3 sssr.ssssssss—- 

GCTAGACAGA TAAAGCAAGT «,^cCCAGTG CTGCAATGAT 

3303 — ».^.-SSSSS»CO.C=™C.. 

tattgatgct ATGCCCTO^v, „,^^.gca ataaaccagc 

33S3 .CC3C.3-C gSSS " ^ 

"'^ Wgctctg GGTGCGAGTG gc ATCCGCCTCC 

3«3 C^^f^S^^^"^^ 

s» -^"^ 

3.03 = iTcSS — ^ 

<3cc=^3" SSS SSU. 

3.3 ^SSISM^--— 
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CGCCGCTGGC TCAA^^ T,tCTTCGGGG 
GTGTATCGTC TTt, ^rnTCCAGTT CQRTGTAACC 

s^ss ssss ?SS.c. 

GCrrrTGAGA GTTCl-li^ ^^^^Tn»rC ACCAGCGTTT 

5« ''^ 

GACCCACTCG TTTTTG ^..cCTTTTTC AATATTATTG 

CGCTGTGCCT TTACiV^ ^^^t..cATA TTTGAATGTA 

TTCGTAAATA GTCCCAA ^,,^,ttTCC CCGAAAAGTG 

,,,, j-^S SSS = 

FUG. 1F 
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Xmn 1(4741) 
Asp 700(4741) 
1(4681) 
Pvu 1(4512) 
Fsp 1(4364) 
Avi n(4364) 
AOS 1(4364) 
CTr 101(4222) 
Bsr n(4222) 
feo 1(4203) 
fo/n 11051(4142) 

Asp ^(41 42) 
M7en(3497) 
Pci 1(3294) 
Bd 1(2905) 
Mlu 1(2889) 
fcoRV (2874) 
/tec 1(2863) 

A7?o 1(2844) 
PaeKIl (2844) 
£!s^ Z1 71(2825) 
11071(2825) 

Bst XI(2617) 




///>?(/ ni(190) 
>lo^ 1(211) 

S/t/ 1(211) 

5/71(260) 
S/7oBI(781) 
£bj^ 1(1096) 
Eel XI(1096) 
/Tsp 1(1096) 
Sac n(1096) 
;A>nain(1096) 
^(/ 101(1190) 
Afe/ 1(1194) 

11021(1260) 
Cel n(l260) 
isp 1(1260) 
Acc m(1535) 
fi!seAI(l535) 
^pE(1535) 
Mm 1(1535) 
'AH n(1768) 
fllTr 1(1768) 
Pvu n(1799) 
//po 1(1853) 
1(1973) 

PRE S2-SAg ORF 
ias^ 1(2507) 
'BstNP 1(2521) 
fispMI(2529) 
£co NI(2536) 



FIG. 2A 
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SEQ ID NO: 2 

— Ol TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 
AGCGCGCAAA GCCACTACTG CCACTTTTGG AGACTGTGTA CGTCGAGGGC 

51 GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 
CTCTGCCAGT GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTGTTCGGGC 

101 TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCGCCCAC AGCCCCGACC GAATTGATAC 

Hindi I I 



151 CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGAA GCTTTTTGCA 
GCCGTAGTCT CGTCTAACAT GACTCTCACG TGGTATACTT CGAAAAACGT 

StuI 



AatI 



201 AAAGCCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 
TTTCGGATCC GGAGGTTTTT TCGGAGGAGT GATGAAGACC TTATCGAGTC 

Sfil 



251 AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA 
TCCGGCTCCG CCGGAGCCGG AGACGTATTT ATTTTTTTTA ATCAGTCGGT 

301 TGGGGCGGAG AATGGGCGGA ACTGGGCGGG GAGGGAATTA TTGGCTATTG 
ACCCCGCCTC TTACCCGCCT TGACCCGCCC CTCCCTTAAT AACCGATAAC 

3 51 GCCATTGCAT ACGTTGTATC TATATCATAA TATGTACATT TATATTGGCT 
CGGTAACGTA TGCAACATAG ATATAGTATT ATACATGTAA ATATAACCGA 

401 CATGTCCAAT ATGACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 
GTACAGGTTA TACTGGCGGT ACAACTGTAA CTAATAACTG ATCAATAATT 

451 TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 
ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT ACCTCAAGGC 

501 CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC GGGTTGCTGG 

551 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 
GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA TTGCGGTTAT 

601 GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 
CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA TTTGACGGGT 

651 CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTCCGCCC CCTATTGACG 
GAACCGTCAT GTAGTTCACA TAGTATACGG TTCAGGCGGG GGATAACTGC 

701 TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 
AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT GTACTGGAAT 

SnaBI 



751 CGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 
GCCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT AGCGATAATG 
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801 CATGGTGATG CGGTTTTGGC AGTACACCAA TGGGCGTGGA TAGCGGTTTG 
GTACCACTAC GCCAAAACCG TCATGTGGTT ACCCGCACCT ATCGCCAAAC 

851 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 
TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT ACCCTCAAAC 



901 TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ATAACCCCGC 
A/U^CCGTGG TTTTAGTTGC CCTGAAAGGT TTTACAGCAT TATTGGGGCG 

951 CCCGTTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 
GGGCAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC CAGATATATT 

1001 GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 
CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC GGTAGGTGCG 

Xmalll 

SacII 



Kspl 



EclXI 



EagI 



1051 TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 
ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG AGGCGCCGGC 

1101 GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 
CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA CTGCATTCAT 

PplOI 



Nsil 



1151 CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 
GGCGGATATC TGAGATATCC GTGTGGGGAA ACCGAGAATA CGTACGATAT 

12 01 CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTCCTTATG CTATAGGTGA 
GACAAAAACC GAACCCCGGA TATGTGGGGG CGAGGAATAC GATATCCACT 

Espl 



Gel 1 1 



Bpull02I 



12 51 TGGTATAGCT TAGCCTATAG GTGTGGGTTA TTGACCATTA TTGACCACTC 
ACCATATCGA ATCGGATATC CACACCCAAT AACTGGTAAT AACTGGTGAG 

1301 CCCTATTGGT GACGATACTT TCCATTACTA ATCCATAACA TGGCTCTTTG 
GGGATAACCA CTGCTATGAA AGGTAATGAT TAGGTATTGT ACCGAGAAAC 

1351 CCACAACTAT CTCTATTGGC TATATGCCAA TACTCTGTCC TTCAGAGACT 
GGTGTTGATA GAGATAACCG ATATACGGTT ATGAGACAGG AAGTCTCTGA 

1401 GACACGGACT CTGTATTTTT ACAGGATGGG GTCCATTTAT TATTTACAAA 
CTGTGCCTGA GACATAAAAA TGTCCTACCC CAGGTAAATA ATAAATGTTT 



FOG. 2G 
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1451 TTCACATATA CAACAACGCC GTCCCCCGTG CCCGCAGTTT TTATTAAACA 
AAGTGTATAT GTTGTTGCGG CAGGGGGCAC GGGCGTCAAA AATAATTTGT 

Mrol 



BspEI 



BseAI 



AccIII 



1501 TAGCGTGGGA TCTCCGACAT CTCGGGTACG TGTTCCGGAC ATGGGCTCTT 
ATCGCACCCT AGAGGCTGTA GAGCCCATGC ACAAGGCCTG TACCCGAGAA 

1551 CTCCGGTAGC GGCGGAGCTT CCACATCCGA GCCCTGGTCC CATCCGTCCA 
GAGGCCATCG CCGCCTCGAA GGTGTAGGCT CGGGACCAGG GTAGGCAGGT 

1601 GCGGCTCATG GTCGCTCGGC AGCTCCTTGC TCCTTU^CAGT GGAGGCCAGA 
CGCCGAGTAC CAGCGAGCCG TCGAGGAACG AGGATTGTCA CCTCCGGTCT 

1651 CTTAGGCACA GCACAATGCC CACCACCACC AGTGTGCCGC ACAAGGCCGT 
GAATCCGTGT CGTGTTACGG GTGGTGGTGG TCACACGGCG TGTTCCGGCA 

1701 GGCGGTAGGG TATGTGTCTG AAAATGAGCT CGGAGATTGG GCTCGCACCT 
CCGCCATCCC ATACACAGAC TTTTACTCGA GCCTCTAACC CGAGCGTGGA 

Bfrl 



Aflll PvuII 



1751 GGACGCAGAT GGAAGACTTA AGGCAGCGGC AGAAGAAGAT GCAGGCAGCT 
CCTGCGTCTA CCTTCTGAAT TCCGTCGCCG TCTTCTTCTA CGTCCGTCGA 

PvuII Hpal 

1801 GAGTTGTTGT ATTCTGATAA GAGTCAGAGG TAACTCCCGT TGCGGTGCTG 
CTCAACAACA TAAGACTATT CTCAGTCTCC ATTGAGGGCA ACGCCACGAC 

Hpal 



1851 TTAACGGTGG AGGGCAGTGT AGTCTGAGCA GTACTCGTTG CTGCCGCGCG 
AATTGCCACC TCCCGTCACA TCAGACTCGT CATGAGCAAC GACGGCGCGC 

1901 CGCCACCAGA CATAATAGCT GACAGACTAA CAGACTGTTC CTTTCCATGG 
GCGGTGGTCT GTATTATCGA CTGTCTGATT GTCTGACAAG GAAAGGTACC 

+2 SEQ ID NO: Q W N 

Sail 



1951 GTCTTTTCTG CAGTCACCGT CGTCGACCTA AGAATTCATG CAGTGGAACT 
CAGAAAAGAC GTCAGTGGCA GCAGCTGGAT TCTTAAGTAT GTCACCTTGA 

+2STAF HQT LQDP RVR GLY 
2001 CCACTGCCTT CCACCAAACT CTGCAGGATC CCAGAGTCAG GGGTCTGTAT 
GGTGACGGAA GGTGGTTTGA GACGTCCTAG GGTCTCAGTC CCCAGACATA 

+2LPAG GSS SGT VNPA PNI 
2051 CTTCCTGCTG GTGGCTCCAG TTCAGGAACA GTAAACCCTG CTCCGAATAT 
GAAGGACGAC CACCGAGGTC AAGTCCTTGT CATTTGGGAC GAGGCTTATA 
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+2 ASH ISSI SAR TGD PVT 
2101 TGCCTCTCAC ATCTCGTCAA TCTCCGCGAG GACTGGGGAC CCTGTGACGA 
ACGGAGAGTG TAGAGCAGTT AGAGGCGCTC CTGACCCCTG GGACACTGCT 

+ 2NMEN ITS GFLG PLL VLQ 
2151 ACATGGAGAA CATCACATCA GGATTCCTAG GACCCCTGCT CGTGTTACAG 
TGTACCTCTT GTAGTGTAGT CCTAAGGATC CTGGGGACGA GCACAATGTC 

+2AGF.F LLT RIL TIPQ SLD 
2201 GCGGGGTTTT TCTTGTTGAC AAGAATCCTC ACAATACCGC AGAGTCTAGA 
CGCCCCAAAA AGAACAACTG TTCTTAGGAG TGTTATGGCG TCTCAGATCT 

+2 SWW TSLN FLG GSP VCL 
2251 CTCGTGGTGG ACTTCTCTCA ATTTTCTAGG GGGATCTCCC GTGTGTCTTG 
GAGCACCACC TGAAGAGAGT TAAAAGATCC CCCTAGAGGG CACACAGAAC 

+2GQNS QSP TSNH SPT SCP 
2 301 GCCAAAATTC GCAGTCCCCA ACCTCCAATC ACTCACCAAC CTCCTGTCCT 
CGGTTTTAAG CGTCAGGGGT TGGAGGTTAG TGAGTGGTTG GAGGACAGGA 

+ 2PICP GYR WMC LRRF IIF 
2351 CCAATTTGTC CTGGTTATCG CTGGATGTGT CTGCGGCGTT TTATCATATT 
GGTTAAACAG GACCAATAGC GACCTACACA GACGCCGCAA AATAGTATAA 

+ 2 LFI LLLC LIF LLV LLD 
24 01 CCTCTTCATC CTGCTGCTAT GCCTCATCTT CTTATTGGTT CTTCTGGATT 
GGAGAAGTAG GACGACGATA CGGAGTAGAA GAATAACCAA GAAGACCTAA 

+ 2YQGM LPV CPLI PGS TTT 
2451 ATCAAGGTAT GTTGCCCGTT TGTCCTCTAA TTCCAGGATC AACAACAACC 
TAGTTCCATA CAACGGGCAA ACAGGAGATT AAGGTCCTAG TTGTTGTTGG 

+2STGP CKT CTT PAQG NSM 

BstAPI 



BspMI ECONI 



2501 AGTACGGGAC CATGCAAAAC CTGCACGACT CCTGCTCAAG GCAACTCTAT 
TCATGCCCTG GTACGTTTTG GACGTGCTGA GGACGAGTTC CGTTGAGATA 

Bsgl 



+ 2 FPS CCCT KPT DGN CTC 
2551 GTTTCCCTCA TGTTGCTGTA CAAAACCTAC GGATGGAAAT TGCACCTGTA 
CAAAGGGAGT ACAACGACAT GTTTTGGATG CCTACCTTTA ACGTGGACAT 

+ 2IPIP SSW AFAK YLW EWA 
BstXI 



2601 TTCCCATCCC ATCGTCCTGG GCTTTCGCAA AATACCTATG GGAGTGGGCC 
AAGGGTAGGG TAGCAGGACC CGAAAGCGTT TTATGGATAC CCTCACCCGG 

+2SVRF SWL SLL VPFV QWF 
2651 TCAGTCCGTT TCTCTTGGCT CAGTTTACTA GTGCCATTTG TTCAGTGGTT 
AGTCAGGCAA AGAGAACCGA GTCAAATGAT CACGGTAAAC AAGTCACCAA 

+ 2 VGL SPTV WLS AIW MMW 
2701 CGTAGGGCTT TCCCCCACTG TTTGGCTTTC AGCTATATGG ATGATGTGGT 
GCATCCCGAA AGGGGGTGAC AAACCGAAAG TCGATATACC TACTACACCA 
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+2YWGP SLY SIVS PFI PLL 
2751 ATTGGGGGCC AAGTCTGTAC AGCATCGTGA GTCCCTTTAT ACCGCTGTTA 
TAACCCCCGG TTCAGACATG TCGTAGCACT CAGGGAAATA TGGCGACAAT 

+2PIFF CLW VYI * 

BstZ17 I Xhol 



Bstll07I PaeR7I 



2801 CCAATTTTCT TTTGTCTCTG GGTATACATT TAAGAATTCA GACTCGAGCA 
GGTTAAAAGA AAACAGAGAC CCATATGTAA ATTCTTAAGT CTGAGCTCGT 

AscI EcoRV Mlul 



2851 AGTCTAGAAA GGCGCGCCAA GATATCAAGG ATCCACTACG CGTTAGAGCT 
TCAGATCTTT CCGCGCGGTT CTATAGTTCC TAGGTGATGC GCAATCTCGA 

Bell 



2901 CGCTGATCAG CCTCGACTGT GCCTTCTAGT TGCCAGCCAT CTGTTGTTTG 
GCGACTAGTC GGAGCTGACA CGGAAGATCA ACGGTCGGTA GACAACATUVC 

2 951 CCCCTCCCCC GTGCCTTCCT TGACCCTGGA AGGTGCCACT CCCACTGTCC 

GGGGAGGGGG CACGGAAGGA ACTGGGACCT TCCACGGTGA GGGTGACAGG 

3001 TTTCCTAATA AAATGAGGAA ATTGCATCGC ATTGTCTGAG TAGGTGTCAT 
AAAGGATTAT TTTACTCCTT TAACGTAGCG TAACAGACTC ATCCACAGTA 

3051 TCTATTCTGG GGGGTGGGGT GGGGCAGGAC AGCAAGGGGG AGGATTGGGA 
AGATAAGACC CCCCACCCCA CCCCGTCCTG TCGTTCCCCC TCCTAACCCT 

3101 AGACAATAGC AGGCATGCTG GGGAGCTCTT CCGCTTCCTC GCTCACTGAC 
TCTGTTATCG TCCGTACGAC CCCTCGAGAA GGCGAAGGAG CGAGTGACTG 

3151 TCGCTGCGCT CGGTCGTTCG GCTGCGGCGA GCGGTATCAG CTCACTCAAA 
AGCGACGCGA GCCAGCAAGC CGACGCCX3CT CGCCATAGTC GAGTGAGTTT 

Pci I 

3201 GGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA 
CCGCCATTAT GCCAATAGGT GTCTTAGTCC CCTATTGCGT CCTTTCTTGT 

Pci I 

3251 TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG 
ACACTCGTTT TCCGGTCGTT TTCCGGTCCT TGGCATTTTT CCGGCGCAAC 

3301 CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG 
GACCGCA7VAA AGGTATCCGA GGCGGGGGGA CTGCTCGTAG TGTTTTTAGC 

3 351 ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG 

TGCGAGTTCA GTCTCCACCG CTTTGGGCTG TCCTGATATT TCTATGGTCC 

3401 CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG 
GCAAAGGGGG ACCTTCGAGG GAGCACGCGA GAGGACAAGG CTGGGACGGC 



F8G. 2F 
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Haell 



3451 CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC 
GAATGGCCTA TGGACAGGCG GAAAGAGGGA AGCCCTTCGC ACCGCGAAAG 

3501 TCAATGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA 
AGTTACGAGT GCGACATCCA TAGAGTCAAG CCACATCCAG CAAGCGAGGT 

3551 AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA 
TCGACCCGAC ACACGTGCTT GGGGGGCAAG TCGGGCTGGC GACGCGGAAT 

3601 TCCGGTAACT ATCGTCTTGA GTCOUVCCCG GTAAGACACG ACTTATCGCC 
AGGCCATTGA TAGCAGAACT CAGGTTGGGC CATTCTGTGC TGAATAGCGG 

3651 ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG 
TGACCGTCGT CGGTGACCAT TGTCCTAATC GTCTCGCTCC ATACATCCGC 

3701 GTGCTACAGA GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGG 
CACGATGTCT CAAGAACTTC ACCACCGGAT TGATGCCGAT GTGATCTTCC 

3751 ACAGTATTTG GTATCTGCGC TCTGCTGT^G CCAGTTACCT TCGGAAAAAG 
TGTCATAAAC CATAGACGCG AGACGACTTC GGTCAATGGA AGCCTTTTTC 

3 801 AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT 
TCAACCATCG AGAACTAGGC CGTTTGTTTG GTGGCGACCA TCGCCACCAA 

3 851 TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA 
AAAAACAAAC GTTCGTCGTC TAATGCGCGT CTTTTTTTCC TAGAGTTCTT 

3 901 GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC 
CTAGGAAACT AGAAAAGATG CCCCAGACTG CGAGTCACCT TGCTTTTGAG 

3 951 ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA 
TGCAATTCCC TAAAACCAGT ACTCTAATAG TTTTTCCTAG AAGTGGATCT 

4001 TCCTTTTAAA TTAAAAATGA AGTTTTAAAT CAATCTAAAG TATATATGAG 
AGGAAAATTT AATTTTTACT TCAAAATTTA GTTAGATTTC ATATATACTC 

4051 TAAACTTGGT CTGACAGTTA CCAATGCTTA ATCAGTGAGG CACCTATCTC 
ATTTGAACCA GACTGTCAAT GGTTACGAAT TAGTCACTCC GTGGATAGAG 



4101 AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC CCCGTCGTGT 

TCGCTAGACA GATAAAGCAA GTAGGTATCA ACGGACTGAG GGGCAGCACA 

4151 AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG 

TCTATTGATG CTATGCCCTC CCGAATGGTA GACCGGGGTC ACGACGTTAC 



EaTnll05I 



AspEI 
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CfrlOI 



BsrFI 



4201 ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA 
TATGGCGCTC TGGGTGCGAG TGGCCGAGGT CTAAATAGTC GTTATTTGGT 
Bsal 



4251 GCCAGCCGGA AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT 
CGGTCGGCCT TCCCGGCTCG CGTCTTCACC AGGACGTTGA AATAGGCGGA 



4301 CCATCCAGTC TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA 
GGTAGGTCAG ATAATTAACA ACGGCCCTTC GATCTCATTC ATCAAGCGGT 

Fspl 



Avill 



AOS I 



4351 GTTAATAGTT TGCGCAACGT TGTTGCCATT GCTACAGGCA TCGTGGTGTC 
CAATTATCAA ACGCGTTGCA ACAACGGTAA CGATGTCCGT AGCACCACAG 

4401 ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC CAACGATCAA 
TGCGAGCAGC AAACCATACC GAAGTAAGTC GAGGCCAAGG GTTGCTAGTT 

4451 GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC 
CCGCTCAATG TACTAGGGGG TACAACACGT TTTTTCGCCA ATCGAGGAAG 

Pvul 



4501 GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT 
CCAGGAGGCT AGCAACAGTC TTCATTCAAC CGGCGTCACA ATAGTGAGTA 

4551 GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT 
CCAATACCGT CGTGACGTAT TAAGAGAATG ACAGTACGGT AGGCATTCTA 

4601 GCTTTTCTGT GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGTGT 
CGAAAAGACA CTGACCACTC ATGAGTTGGT TCAGTAAGAC TCTTATCACA 

Bcgl 



4651 ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG TCAATACGGG ATAATACCGC 
TACGCCGCTG GCTCAACGAG AACGGGCCGC AGTTATGCCC TATTATGGCG 

Xmni 



Asp700 



4701 GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG 
CGGTGTATCG TCTTGAAATT TTCACGAGTA GTAACCTTTT GCAAGAAGCC 

4751 GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG TTCGATGTAA 
CCGCTTTTGA GAGTTCCTAG AATGGCGACA ACTCTAGGTC AAGCTACTVTT 

4801 CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT 
GGGTGAGCAC GTGGGTTGAC TAGAAGTCGT AGAAAATGAA AGTGGTCGCA 
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4851 TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA 
AAGACCCACT CGTTTTTGTC CTTCCGTTTT ACGGCGTTTT TTCCCTTATT 

4901 GGGCGACACG GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT 
CCCGCTGTGC CTTTACAACT TATGAGTATG AGAAGGAAAA AGTTATAATA 

4951 TGAAGCATTT ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAATG 
ACTTCGTAAA TAGTCCCAAT AACAGAGTAC TCGCCTATGT ATAAACTTAC 

5001 TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT CCCCGAAAAG 
ATAAATCTTT TTATTTGTTT ATCCCCAAGG CGCGTGTAAA GGGGCTTTTC 

5051 TGCCACCTGA CGTCTAAGAA ACCATTATTA TCATGACATT AACCTATAAA 
ACGGTGGACT GCAGATTCTT TGGTAATAAT AGTACTGTAA TTGGATATTT 

5101 AATAGGCGTA TCACGAGGCC CTTTCGTC 
TTATCCGCAT AGTGCTCCGG GAAAGCAG 
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Ssp 1(5277) 
Xmn 1(5072) 
Asp 700 (5072) 
Beg li50M) 
Pvu 1(4843) 
fiSsa 1(4534) 
Earn 11051(4473) 
Asp n (4473) 

Pd 1(3580) 
Bel 1(3236) 
Mu 1(3220) 
BamH 1(3211) 
£co RV(3205) 
Asc 1(3194) 

AV/o 1(3175) 
Fae R7I(3175) 
Bst 2171(3156) 
fe/ 11071(3156) 

Eco NI(2867) 
Bst AP 1(2852) 




//OT<y in(190) 
Aat 1{2U) 
S/«/ 1(211) 

5/7 1(260) 

Sna BI(781) 
£09 1(1096) 
£c/ XI(1096) 
Ksp 1(1096) 
S7C n(1096) 
Xma m(1096) 
Ppu 101(1190) 
Nsi 1(1194) 
5pw 11021(1260) 
Ce/n(1260) 
£sp 1(1260) 
>1ccffl(1535) 
flseAI(1535) 
£Esp 0(1535) 
1(1535) 
Af/ n(1768) 
fl!rrl(1768) 
Hpa 1(1853) 
'50/ 1(1973) 
Afe 1(2061) 
fco 47m(2061) 
/V/>e 1(2062) 
Sex AI(2069) 
fl&r PI(2100) 
Pma CI(2100) 
An/ 1(2100) 
Z2fu in(2306) 
Ppu MI(2512) 
ORF opti 330/S/^ 
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SEQ ID NO: 4 

— Ol TCGCGCGTTT CGGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCCG 
AGCGCGCAAA GCCACTACTG CCACTTTTGG AGACTGTGTA CGTCGAGGGC 



51 GAGACGGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG 
CTCTGCCAGT GTCGAACAGA CATTCGCCTA CGGCCCTCGT CTGTTCGGGC 

101 TCAGGGCGCG TCAGCGGGTG TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
AGTCCCGCGC AGTCGCCCAC AACCGCCCAC AGCCCCGACC GAATTGATAC 

Hindi II 



151 CGGCATCAGA GCAGATTGTA CTGAGAGTGC ACCATATGAA GCTTTTTGCA 
GCCGTAGTCT CGTCTAACAT GACTCTCACG TGGTATACTT CGAAAAACGT 

StuI 



AatI 



201 AAAGCCTAGG CCTCCAAAAA AGCCTCCTGA CTACTTCTGG AATAGCTCAG 
TTTCGGATCC GGAGGTTTTT TCGGAGGAGT GATGAAGACC TTATCGAGTC 

Sf il 



251 AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA 
TCCGGCTCCG CCGGAGCCGG AGACGTATTT ATTTTTTTTA ATCAGTCGGT 

301 TGGGGCGGAG AATGGGCGGA ACTGGGCGGG GAGGGAATTA TTGGCTATTG 
ACCCCGCCTC TTACCCGCCT TGACCCGCCC CTCCCTTAAT AACCGATAAC 

351 GCCATTGCAT ACGTTGTATC TATATCATT^ TATGTACATT TATATTGGCT 
CGGTAACGTA TGCAACATAG ATATAGTATT ATACATGTAA ATATAACCGA 

401 CATGTCCAAT ATGACCGCCA TGTTGACATT GATTATTGAC TAGTTATTAA 
GTACAGGTTA TACTGGCGGT ACAACTGTAA CTAATAACTG ATCAATAATT 

451 TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA TGGAGTTCCG 
ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT ACCTCAAGGC 

501 CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC GGGTTGCTGG 

551 CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA 
GGGCGGGTAA CTGCAGTTAT TACTGCATAC AAGGGTATCA TTGCGGTTAT 

601 GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT AAACTGCCCA 
CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA TTTGACGGGT 

651 CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTCCGCCC CCTATTGACG 
GAACCGTCAT GTAGTTCACA TAGTATACGG TTCAGGCGGG GGATAACTGC 

701 TCAATGACGG TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA 
AGTTACTGCC ATTTACCGGG CGGACCGTAA TACGGGTCAT GTACTGGAAT 

SnaBI 



751 CGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA TCGCTATTAC 
GCCCTGAAAG GATGAACCGT CATGTAGATG CATAATCAGT AGCGATAATG 
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801 CATGGTGATG CGGTTTTGGC AGTACACCAA TGGGCGTGGA TAGCGGTTTG 
GTACCACTAC GCCAAAACCG TCATGTGGTT ACCCGCACCT ATCGCCAAAC 

851 ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG 
TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT ACCCTCAAAC 



901 TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ATAACCCCGC 
AAAACCGTGG TTTTAGTTQC CCTGAAAGGT TTTACAGCAT TATTGGGGCG 

951 CCCGTTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA 
GGGCAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC CAGATATATT 

1001 GCAQAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG CCATCCACGC 
CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC GGTAGGTGCG 

Xmalll 



SacII 



Kspl 



EclXI 



Eagi 



1051 TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC TCCGCGGCCG 
ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG AGGCGCCGGC 

1101 GGAACGGTGC ATTGGAACGC GGATTCCCCG TGCCAAGAGT GACGTAAGTA 
CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA CTGCATTCAT 

Ppul 01 

Nsil 



1151 CCGCCTATAG ACTCTATAGG CACACCCCTT TGGCTCTTAT GCATGCTATA 

GGCGGATATC TGAGATATCC GTGTGGGGAA ACCGAGAATA CGTACGATAT 

1201 CTGTTTTTGG CTTGGGGCCT ATACACCCCC GCTCCTTATG CTATAGGTGA 

GACAAAAACC GAACCCCGGA TATGTGGGGG CGAGGAATAC GATATCCACT 

Espl 



Celll 



Bpull02I 



1251 TGGTATAGCT TAGCCTATAG GTGTGGGTTA TTGACCATTA TTGACCACTC 
ACCATATCGA ATCGGATATC CACACCCAAT AACTGGTAAT AACTGGTGAG 

1301 CCCTATTGGT GACGATACTT TCCATTACTA ATCCATAACA TGGCTCTTTG 
GGGATAACCA CTGCTATGAA AGGTAATGAT TAGGTATTGT ACCGAGAAAC 

1351 CCACAACTAT CTCTATTGGC TATATGCCAA TACTCTGTCC TTCAGAGACT 
GGTGTTGATA GAGATAACCG ATATACGGTT ATGAGACAGG AAGTCTCTGA 

1401 GACACGGACT CTGTATTTTT ACAGGATGGG GTCCATTTAT TATTTACAAA 
CTGTGCCTGA GACATAAAAA TGTCCTACCC CAGGTAAATA ATAAATGTTT 




Appln. No. 10/715,665 
Replacement Sheet 



1451 TTCACATATA CAACAACGCC GTCCCCCGTG CCCGCAGTTT TTATTAAACA 
AAGTGTATAT GTTGTTGCGG CAGGGGGCAC GGGCGTCAAA AATAATTTGT 

Mrol 



BspEI 



BseAI 



AccIII 



1501 TAGCGTGGGA TCTCCGACAT CTCGGGTACG TGTTCCGGAC ATGGGCTCTT 
ATCGCACCCT AGAGGCTGTA GAGCCCATGC ACAAGGCCTG TACCCGAGAA 

1551 CTCCGGTAGC GGCGGAGCTT CCACATCCGA GCCCTGGTCC CATCCGTCCA 
GAGGCCATCG CCGCCTCGAA GGTGTAGGCT CGGGACCAGG GTAGGCAGGT 

1601 GCGGCTCATG GTCGCTCGGC AGCTCCTTGC TCCTAACAGT GGAGGCCAGA 
CGCCGAGTAC CAGCGAGCCG TCGAGGAACG AGGATTGTCA CCTCCGGTCT 

1651 CTTAGGCACA GCACAATGCC CACCACCACC AGTGTGCCGC ACAAGGCCGT - 
GAATCCGTGT CGTGTTACGG GTGGTGGTGG TCACACGGCG TGTTCCGGCA 

1701 GGCGGTAGGG TATGTGTCTG AAAATGAGCT CGGAGATTGG GCTCGCACCT 
CCGCCATCCC ATACACAGAC TTTTACTCGA GCCTCTAACC CGAGCGTGGA 

Bfrl 



Aflll 



1751 GGACGCAGAT GGAAGACTTA AGGCAGCGGC AGAAGAAGAT GCAGGCAGCT 
CCTGCGTCTA CCTTCTGAAT TCCGTCGCCG TCTTCTTCTA CGTCCGTCGA 

Hpal 

1801 GAGTTGTTGT ATTCTGATAA GAGTCAGAGG TAACTCCCGT TGCGGTGCTG 
CTCAACAACA TAAGACTATT CTCAGTCTCC ATTGAGGGCA ACGCCACGAC 

Hpal 



1851 TTAACGGTGG AGGGCAGTGT AGTCTGAGCA GTACTCGTTG CTGCCGCGCG 
AATTGCCACC TCCCGTCACA TCAGACTCGT CATGAGCAAC GACGGCGCGC 

1901 CGCCACCAGA CATAATAGCT GACAGACTAA CAGACTGTTC CTTTCCATGG 
GCGGTGGTCT GTATTATCGA CTGTCTGATT GTCTGACAAG GAAAGGTACC 

+3 SEQ ID NO: 5 O M D A 

Sail 



1951 GTCTTTTCTG CAGTCACCGT CGTCGACGAA TTCAAGCAAT CATGGATGCA 
CAGAAAAGAC GTCAGTGGCA GCAGCTGCTT AAGTTCGTTA GTACCTACGT 

+ 3MKRG LCC VLL LCGA VFV 
2001 ATGAAGAGAG GGCTCTGCTG TGTGCTGCTG CTGTGTGGAG CAGTCTTCGT 
TACTTCTCTC CCGAGACGAC ACACGACGAC GACACACCTC GTCAGAAGCA 




Appln. NO. ^^/ll^'t' 
Replacement sneeu 



+3 



P S 



A S Y Q 
Nhel 



EC047III 
Afe I 



L Y H 

Pmll 

PtnaCI 

BbrPI 



SexAl 

VTND CPN SSI 
Pmll 



+3 



pmaCI 

~ =is ris ssss^T^isi 

— « r\ K 



+3 
2151 



- A T R D G K 
DCW VAMT P T V GACGGCAAGC 

" - sE^c ssss sss 



L P A T 



Q L R 



HID 



li V 



G S A 
Drain 



^3 TLCSALYV 

sss? sss?s =s 



+ 3 
2451 



^ D M M .M N W S P T JJ^^Jq/acATCACATC 

sssss sss ssss 4-"- 
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+3TRIL TI.P QSLD SWW TSL 
2551 CAAGAATCCT CACAATACCG CAGAGTCTAG ACTCGTGGTG GACTTCTCTC 
GTTCTTAGGA GTGTTATGGC GTCTCAGATC TGAGCACCAC CTGAAGAGAG 

+ 3 NFLG GSP VCL GQNS QSP 
2601 AATTTTCTAG GGGGATCTCC CGTGTGTCTT GGCCAAAATT CGCAGTCCCC 
TTAAAAGATC CCCCTAGAGG GCACACAGAA CCGGTTTTAA GCGTCAGGGG 

+ 3 TSN HSPT SCP PIC PGY 
2651 AACCTCCAAT CACTCACCAA CCTCCTGTCC TCCAATTTGT CCTGGTTATC 
TTGGAGGTTA GTGAGTGGTT GGAGGACAGG AGGTTAAACA GGACCAATAG 

+ 3RWMC LRR FIIF LFI LLL 
2701 GCTGGATGTG TCTGCGGCGT TTTATCATAT TCCTCTTCAT CCTGCTGCTA 
CGACCTACAC AGACGCCGCA AAATAGTATA AGGAGAAGTA GGACGACGAT 

+ 3 CLIF LLV LLD YQGM LPV 
2751 TGCCTCATCT TCTTATTGGT TCTTCTGGAT TATCAAGGTA TGTTGCCCGT 
ACGGAGTAGA AGAATAACCA AGAAGACCTA ATAGTTCCAT ACAACGGGCA 

+3 CPL IPGS TTT STG PCK 

BstAPI 



2801 TTGTCCTCTA ATTCCAGGAT CAACAACAAC CAGTACGGGA CCATGCAAAA 

AACAGGAGAT TAAGGTCCTA GTTGTTGTTG GTCATGCCCT GGTACGTTTT 

+3TCTT PAQ GNSM FPS CCC 
BstAP I EcoNI 



2851 CCTGCACGAC TCCTGCTCAA GGCAACTCTA TGTTTCCCTC ATGTTGCTGT 
GGACGTGCTG AGGACGAGTT CCGTTGAGAT ACAAAGGGAG TACAACGACA 

+ 3 TKPT DGN CTC IPIP SSW 
2 901 ACAAAACCTA CGGATGGAAA TTGCACCTGT ATTCCCATCC CATCGTCCTG 
TGTTTTGGAT GCCTACCTTT AACGTGGACA TAAGGGTAGG GTAGCAGGAC 

+3 AFA KYLWE^WA SVR FSW 
2951 GGCTTTCGCA AAATACCTAT GGGAGTGGGC CTCAGTCCGT TTCTCTTGGC 
CCGAAAGCGT TTTATGGATA CCCTCACCCG GAGTCAGGCA AAGAGAACCG 

+3LSLL VPF VQWF VGL SPT 
3001 TCAGTTTACT AGTGCCATTT GTTCAGTGGT TCGTAGGGCT TTCCCCCACT 
AGTCAAATGA TCACGGTAAA CAAGTCACCA AGCATCCCGA AAGGGGGTGA 

+ 3 VWLS AIW MMW YWGP SLY 
3051 GTTTGGCTTT CAGCTATATG GATGATGTGG TATTGGGGGC CAAGTCTGTA 
CAAACCGAAA GTCGATATAC CTACTACACC ATAACCCCCG GTTCAGACAT 

+3 SIV SPFI PLL PIF FCL 
3101 CAGCATCGTG AGTCCCTTTA TACCGCTGTT ACCAATTTTC TTTTGTCTCT 
GTCGTAGCAC TCAGGGAAAT ATGGCGACAA TGGTTAAAAG AAAACAGAGA 

+ 3 W V Y I * 

BstZ17 I Xhol 



Bstll07I PaeR7I AscI 



3151 GGGTATACAT TTAAGAATTC AGACTCGAGC AAGTCTAGAA AGGCGCGCCA 
CCCATATGTA AATTCTTAAG TCTGAGCTCG TTCAGATCTT TCCGCGCGGT 



FDG. 3F 



Appln. No. 10/715,665 
Replacement Sheet 



EcoRV 



BamHI 



Mlul 



Bell 



3201 AGATATCAAG GATCCACTAC GCGTTAGAGC TCGCTGATCA GCCTCGACTG 
TCTATAGTTC CTAGGTGATG CGCAATCTCG AGCGACTAGT CGGAGCTGAC 

3251 TGCCTTCTAG TTGCCAGCCA TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC 
ACX3GAAGATC AACGGTCGGT AGACAACAAA CGGGGAGGGG GCACGGAAGG 

3301 TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA 
AACTGGGACC TTCCACGGTG AGGGTGACAG GAAAGGATTA TTTTACTCCT 

3351 AATTGCATCG CATTGTCTGA GTAGGTGTCA TTCTATTCTG GGGGGTGGGG 
TTAACGTAGC GTAACAGACT CATCCACAGT AAGATAAGAC CCCCCACCCC 

34 01 TGGGGCAGGA CAGCAAGGGG GAGGATTGGG AAGACAATAG CAGGCATGCT 
ACCCCGTCCT GTCGTTCCCC CTCCTAACCC TTCTGTTATC GTCCGTACGA 

3451 GGGGAGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC TCGGTCGTTC 
CCCCTCGAGA AGGCGAAGGA GCGAGTGACT GAGCGACGCG AGCCAGCAAG 

3501 GGCTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT ACGGTTATCC 
CCGACGCCGC TCGCCATAGT CGAGTGAGTT TCCGCCATTA TGCCAATAGG 



3551 ACAGAATCAG GGGATAACGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA 
TGTCTTAGTC CCCTATTGCG TCCTTTCTTG TACACTCGTT TTCCGGTCGT 

3601 AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC 
TTTCCGGTCC TTGGCATTTT TCCX3GCGCAA CGACCGCAAA AAGGTATCCG 

3651 TCCGCCCCCC TGACGAGCAT CACAAAAATC GACGCTCAAG TCAGAGGTGG 
AGGCGGGGGG ACTGCTCGTA GTGTTTTTAG CTGCGAGTTC AGTCTCCACC 

3701 CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC CTGGAAGCTC 
GCTTTGGGCT GTCCTGATAT TTCTATGGTC CGCAAAGGGG GACCTTCGAG 

3751 CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 
GGAGCACGCG AGAGGACAAG GCTGGGACGG CGAATGGCCT ATGGACAGGC 

3801 CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG 
GGAAAGAGGG AAGCCCTTCG CACCGCGAAA GAGTTACGAG TGCGACATCC 

3851 TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA 
ATAGAGTCAA GCCACATCCA GCAAGCGAGG TTCGACCCGA CACACGTGCT 

3901 ACCCCCCGTT CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG 
TGGGGGGCAA GTCGGGCTGG CGACGCGGAA TAGGCCATTG ATAGCAGAAC 

3951 AGTCCAACCC GGTAAGACAC GACTTATCGC CACTGGCAGC AGCCACTGGT 
TCAGGTTGGG CCATTCTGTG CTGAATAGCG GTGACCGTCG TCGGTGACCA 

4001 AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG AGTTCTTQAA 
TTGTCCTAAT CGTCTCGCTC CATACATCCG CCACGATGTC TCAAGAACTT 

4051 GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 
CACCACCGGA TTGATGCCGA TGTGATCTTC CTGTCATAAA CCATAGACGC 



Pci 



I 



FB 
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4101 CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC 
GAGACGACTT CGGTCAATGG AAGCCTTTTT CTCAACCATC GAGAACTAGG 

4151 GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA 
CCGTTTGTTT GGTGGCGACC ATCGCCACCA AAAAAACAAA CGTTCGTCGT 

4201 GATTACGCGC AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA 
CTAATGCGCG TCTTTTTTTC CTAGAGTTCT TCTAGGAAAC TAGAAAAGAT 

4251 CGGGGTCTGA CGCTCAGTGG AACGAAAACT CACGTTAAGG GATTTTGGTC 
GCCCCAGACT GCGAGTCACC TTGCTTTTGA GTGCAATTCC CTAAAACCAG 

4301 ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA ATTAAAAATG 
TACTCTAATA GTTTTTCCTA GAAGTGGATC TAGGAAAATT TAATTTTTAC 

4351 AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 
TTCAAAATTT AGTTAGATTT CATATATACT CATTTGAACC AGACTGTCAA 

4401 ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT 
TGGTTACGAA TTAGTCACTC CGTGGATAGA GTCGCTAGAC AGATAAAGCA 



4451 TCATCCATAG TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA 
AGTAGGTATC AACGGACTGA GGGGCAGCAC ATCTATTGAT GCTATGCCCT 

4501 GGGCTTACCA TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT 
CCCGAATGGT AGACCGGGGT CACGACGTTA CTATGGCGCT CTGGGTGCGA 

Bsal 



4551 CACCGGCTCC AGATTTATCA GCAATAAACC AGCCAGCCGG AAGGGCCGAG 
GTGGCCGAGG TCTAAATAGT CGTTATTTGG TCGGTCGGCC TTCCCGGCTC 

4601 CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT CTATTAATTG 
GCGTCTTCAC CAGGACGTTG AAATAGGCGG AGGTAGGTCA GATAATTAAC 

4651 TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 
AACGGCCCTT CGATCTCATT CATCAAGCGG TCAATTATCA AACGCGTTGC 

4701 TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG 
AACAACGGTA ACGATGTCCG TAGCACCACA GTGCGAGCAG CAAACCATAC 

4 751 GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC 
CGAAGTAAGT CGAGGCCAAG GGTTGCTAGT TCCGCTCAAT GTACTAGGGG 



4801 CATGTTGTGC AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA 
GTACAACACG TTTTTTCGCC AATCGAGGAA GCCAGGAGGC TAGCAACAGT 

4851 GAAGTAAGTT GGCCGCAGTG TTATCACTCA TGGTTATGGC AGCACTGCAT 
CTTCATTCAA CCGGCGTCAC AATAGTGAGT ACCAATACCG TCGTGACGTA 

4901 AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG TGACTGGTGA 
TTAAGAGAAT GACAGTACGG TAGGCATTCT ACGAAAAGAC ACTGACCACT 



Eamll05I 



AspEI 



Pvul 




Appln. No. 10/715,665 
Replacement Sheet 



Bcgl 



4951 GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT 
CATGAGTTGG TTCAGTAAGA CTCTTATCAC ATACGCCGCT GGCTCAACGA 

5001 CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA 
GAACGGGCCG CAGTTATGCC CTATTATGGC GCGGTGTATC GTCTTGAAAT 

XmnI 



Asp700 



5051 AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT 
TTTCACGAGT AGTAACCTTT TGCAAGAAGC CCCGCTTTTG AGAGTTCCTA 

5101 CTTACCGCTG TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT 
GAATGGCGAC AACTCTAGGT CAAGCTACAT TGGGTGAGCA CGTGGGTTGA 

5151 GATCTTCAGC ATCTTTTACT TTCACCAGCG TTTCTGGGTG AGCAAAAACA 
CTAGAAGTCG TAGAAAATGA AAGTGGTCGC AAAGACCCAC TCGTTTTTGT 

5201 GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC GGAATGTTG 
CCTTCCGTTT TACGGCGTTT TTTCCCTTAT TCCCGCTGTG CCTTTACAAC 

Sspl 



5251 AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 
TTATGAGTAT GAGAAGGAAA AAGTTATAAT AACTTCGTAA ATAGTCCCAA 

5301 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA 
TAACAGAGTA CTCGCCTATG TATAAACTTA CATAAATCTT TTTATTTGTT 

53 51 ATAGGGGTTC CGCGCACATT TCCCCGAAAA GTGCCACCTG ACGTCTAAGA 
TATCCCCAAG GCGCGTGTAA AGGGGCTTTT CACGGTGGAC TGCAGATTCT 

5401 AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT ATCACGAGGC 
TTGGTAATAA TAGTACTGTA ATTGGATATT TTTATCCGCA TAGTGCTCCG 

5451 CCTTTCGTC 
GGAAAGCAG 
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